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Gene expression from the Moloney murine leukemia retrovirus (Mo-MuLV) is highly restricted in embryonic 
carcinoma (EC) and embryonic stem (ES) cells. We compared levels of expression in PA317 fibroblasts, F9 
(EC) cells, and CCE (ES) cells by Mo-MuLV-based vectors and vectors based on our previously reported MND 
backbone, which has alterations to address three viral elements implicated as repressors of expression by 
Mo-MuLV: the enhancer, the primer binding site, and the negative-control region. Expression was evaluated 
with three reporter genes, the chloramphenicol acetyltransferase (CAT) gene, whose expression was measured 
by enzymatic assay and by Northern blotting; a truncated nerve growth factor receptor (tNGFR), whose 
expression was measured by fluorescence-activated cell sorting (FACS) as a cell surface protein; and the 
enhanced green fluorescent protein (EGFP), whose expression was measured intracellularly by flow cytometry. 
We found significantly higher levels of CAT activity (5- to 300-fold) and greater quantities of vector-specific 
transcripts in ES and EC cells transduced with the modified MND-CAT-SN vector than in those transduced 
with L-CAT-SN. Northern blot analysis indicated that long terminal repeat transcripts from MND-CAT-SN are 
>80 times more abundant than the L-CAT-SN transcripts. FACS analysis of tNGFR expression from a pair 
of vectors, L-tNGFR-SN and MND-tNGFR-SN, indicated that only 1.04% of the CCE cells containing the 
L-tNGFR-SN vector expressed the cell surface reporter, while the MND-tNGFR-SN vector drove expression in 
99.54% of the CCE cells. Of the F9 cells containing the L-tNGFR-SN vector, 13.32% expressed tNGFR, while 
99.89% of the F9 cells transduced with MND-tNGFR-SN showed expression. Essentially identical results were 
produced with an analogous pair of vectors encoding EGFP. In unselected pools of F9 cells 48 h posttrans- 
duction, the L-EGFP-SN vector drove expression in only 5% of the population while the MND -EGFP- SN vector 
drove expression in 88% of the cells. After more than 3 weeks in culture without selection, the proportion of 
cells showing expression from L-EGFP-SN decreased slightly to 3% while expression from the MND-EGFP-SN 
vector persisted in 80% of the cells. Interestingly, in the few ES and EC cells which did show expression from 
the L-tNGFR-SN or L-EGFP-SN vectors, the magnitude of reporter expression was similar to that from the 
MND-tNGFR-SN or MND-EGFP-SN vector in nearly all cells, suggesting that the MND vectors are far less 
susceptible to position-dependent variegation of expression than are the Mo-MuLV -based vectors. Therefore, 
the modified retroviral vector, MND, achieves higher net levels of expression due to a greater frequency of 
expression, which may be useful for the expression of exogenous genes in EC and ES cells. 



Moloney murine leukemia virus (Mo-MuLV)-based vectors 
are the predominant gene transfer vehicles used for stable 
transduction of mammalian cells in vitro, despite some recog- 
nized disadvantages (6a, 33). Gene expression from the Mo- 
MuLV long terminal repeat (LTR) has been disappointing in 
murine embryonic stem (HS). embryonic carcinoma (HO. and 
hematopoietic cells (8, 29) as well as in human hematopoietic 
cells (1). 

Persistent high-level expression from retroviral vectors in ES 
and EC cells could be an extremely useful tool for the evalu- 
ation of the effects of exogenous genes on mammalian cell 
development. For example, ES cells can be induced to undergo 
differentiation into cells of the hematopoietic lineages; efficient 
retrovirus-mediated gene transfer and expression in ES cells 
would provide a model system with which to evaluate the 
effects of genes which regulate hematopoietic cell differentia- 
tion. Reliable expression of genes introduced into hematopoi- 
etic stem cells (HSC) is a central requisite for effective clinical 
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gene therapy of a variety of genetic disorders (24). Therefore, 
understanding the mechanisms which control transcription 
from the LTRs of retroviral vectors may allow the develop- 
ment of modified vectors which achieve more effective expres- 
sion in stem cells. 

The enhancer/promoter of the Mo-MuLV has been shown 
to be inactive in EC and ES cells (4, 18, 20, 27). A related virus, 
the myeloproliferative sarcoma virus (MPSV) (25), arose as a 
spontaneous mutant of the Moloney murine sarcoma virus 
(MSV), which caused myeloproliferative disorders instead of 
sarcomas. Several groups have demonstrated that MPSV has 
greater transcriptional activity than Mo-MuLV in EC cells (20, 
21, 50) and in hematopoietic cells (4, 39, 46). The LTR of 
MPSV has several base-pair changes compared to the LTR 
sequences of Mo-MuLV and MSV, with one single -base-pair 
alteration in the enhancer of MPSV creating a consensus bind- 
ing site for the Spl transcription factor; this new Spl site was 
shown to be responsible for the altered pathogenic pattern of 
MPSV and increased transcriptional activity in EC cells (41). 

By constructing and selecting for recombinant retroviruses 
which had enhanced infectivity and expressivity in EC cells, 
Barklis et al. (2), Loh et al. (28), and Weiher et al. (50) 
determined that sequences contained within the 5'-untrans- 
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lated region immediately downstream of the 5' LTR in or near 
the primer binding site (PBS) also inhibited I.TR-directed 
expression. Barklis et al. (2) found that a single point mutation 
in the PBS of the B2 proviral recombinant was sufficient to 
permit expression from a Mo-MuLV template in EC cells. 
Subsequent research efforts identified a frans-acting factor 
termed the repressor binding protein which bound to the Mo- 
MuLV PBS and interfered with transcription from the LTR 
(14, 23, 40). An endogenous murine retrovirus, d/587rev, was 
shown to escape the PBS-associated repression by utilizing a 
variant PBS which has multiple base pairs divergent from Mo- 
MuLV that eliminate the PBS-associated repressor-binding 
site and result in the utilization of a tRNA G ' u instead of a 
1RNA 1 '"' lor the priming of reverse transcription (1 1. 40, 47). 

Transcriptional repression of Mo-MuLV in ES and EC cells 
is mediated, in part, by the binding of the transcription factor 
YY1 (also known as UCRBP or NCRBP) to a highly conserved 
sequence (5'-TAAC( iCCATTT-3') in the 5' LTR termed the 
negative-control region (NCR) (3, 15, 16, 48). 

Repression of expression of endogenous and exogenous 
gene sequences in mammalian cells is frequently associated 
with methylation of cytosine residues in DNA (5, 7). Although 
it is not clear whether cytosine methylation is a primary cause 
of transcriptional silencing or merely an associated result, the 
correlation between the two events suggests an active role for 
methylation in the down regulation or maintenance of tran- 
scriptional repression of many genes. Mo-MuLVs are prone to 
extensive de novo methylation in EC cells (38, 45), as well as in 
murine fibroblast cells (22). Challita and Kohn (8) and, re- 
cently, Lengauer et al. (26) demonstrated a strong correlation 
between methylation of the Mo-MuLV LTR and the lack of 
LTR-directed transcription in murine HSC and human colo- 
rectal cancer cells, respectively. 

We previously described a novel retroviral vector, Mp-ncr- 
dl-neo (herein called MND-neo) which contains modifications 
of the LTR enhancer, the NCR, and the PBS (9). MND-neo 



was shown to have greatly increased expression in a titer assay 
with F9 EC cells compared to a parallel vector containing the 
standard elements from Mo-MuLV. To further examine the 
triply modified vector MND (MPSV LTR, NCR deleted, rf/587 
PBS), we constructed the MND-X-SN backbone, which al- 
lowed convenient insertion of different reporter genes. We 
analyzed chloramphenicol acetyltransferase (CAT) enzyme ac- 
tivity, truncated nerve growth factor receptor (tNGFR) surface 
reporter expression, and enhanced green fluorescent protein 
(EGFP) expression from the Mo-MuLV and MND vector 
backbones in ES, EC, and fibroblast cells. 



MATERIALS AND METHODS 

Cell lines. PA317 and F9 cell lines wen obtained from the American Type 
Culture Collection. 1 he PA317 atnphotropic packaging o II-. (35), derived from 
Mil 3 13 fibroblasts, were grown in Dulbccco's modified Lack's medium 
(DMLMI containing 10'', fetal call serum (ICS), penicillin-streptomycin (SO 
U/ml), and L-glutamine (2 mMl. I he I 9 EC cell line was maintained on gelatin- 
coated tissue cultuie plates in I >MIM supplemented with 15'V. ICS, penicillin- 
sticplomvcin (50 L ml), i-glul. inline (2 m\t), and 0.1 m\t (final concentration) 
p-mercaptoethanol. The CCL I S cell line (13) was generously provided bv 
Jctlrc\ Mann (Cil\ of I lope Medical Center, 1 luartc, Calif.). I he ( CL cells were 
propagated in the same medium as the F9 cells, with the addition of recombinant 
leukemia inhibitory factor (0.1 ng/ml: R+D Systems, Minneapoli Min i i > 

(31), a generous gift from Arthur Bank, Columbia University, New York, N.Y., 
were kept under selection in HXM medium (DMEM supplemented with 10% 
newborn calf serum, h\ poxanlhine |I5 |j-gml|, xanthine |250 u.g/ml|, and myco- 
phenolic acid |25 u.g ml|) until viral supernatant was needed, at which point the 
medium was changed to I iVILVl supplemented with 10'V. ICS, penicillin-strep- 
tomycin, and i -glu Lam inc. 

Vectors. The LN vector was constructed in the laboratory of A. Dusty Miller 
(34). The L-CAT-SN vector was constructed by insertion of a fragment from 
pSV2-CAT encoding the bacterial C.YI gene into the Hpa\ site of the LXSN 
vector plasmid (Fig. 1) (34). 

The construction of MND-neo was previously described by Challita et al. (9). 
To construct a multipurpose MND vector backbone, the multicloning site and 
the simian virus 40 (SV40) promoter from LXSN were cloned in front of neo in 
MND-neo to generate the vector MXD-X-SX, where "X" represents unique 
EcoRl.Hpal, and \'hn\ cloning sites. MXD-CAT-SX was assembled by cleaving 
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the MND-neo vector upstream of the neo gene al S /I and inserting a Hindlll 
fragment containing the CAT gene and SV40 promoter from L-CAT-SN (Fig. 1). 

[wo pairs of \ectors which incorporate out of wo fluorescence-activated cell 
sorting (f-ACS)-dclcclablc rcpoitci acnes, cither that a truncated form of rat 
nerve growth factor receptor (INCH R) or that lot the enhanced green lluorcs- 

of the Mo-MuLV-driven vector, L-tNGFR-SN, are described in Malik et al. (30). 
The modified version, MND-tNGFR-SN, was constructed by inserting a 940-bp 
EcoRl-Xhol fragment containing the tNGFR gene from L-tNGFR-SN into the 
same sites in the MND-X-SN backbone (Fig. 1). The L-EGFP-SN and MND- 
liC il I'-SN vectors were const meted b\ insciting />VI!-.\".'I fragments containing 
the EGFP gene (Clontech Laboratories, Palo Alto, Calif.) into theHpal sites in 
the polyl inkers of LXSN and MND-X-SN. respectively (Fig. 1). 

Packaging of retroviral vectors. A. Dusty Miller supplied a clone of PA317 
packaging cells producing the I.N vccloi at a hisjh liter (10"'ml) (35). The 
L-CAT-SN, MND-CAT-SN, L-tNGFR-SN, MND-tNGFR-SN, L-EGFP-SN, 
and MND-EGFP-SN vectors were packaged by transfecting their vector plasmids 
into GP+E-86 cells (31). Ecotropic viral supernatant was used to generate 
C 1418-rcsislant PA3I7 pools from which ampholropic virus supernatant was 
collected as described by Challita et al. (9). 

cells under the same conditions as those for the fibroblasts. F9 and CCE cell 
pools were selected in ( 14 1 .s (500 |xg ml ) lot appi oximaklv 14 da\s, aflet which 
Ihcv were cultured in DM1 Al supplemented w ith 15', 1-CS, penicillin-strepto- 
mycin (50 U/ml), L-glutamine (2 mM), and 0.1 mM (final concentration) p- 

inhibilory factor (0.1 ng/ml). Cells were assayed lot C A ' activity, INCLR, oi 
li( II P expression while in loi> phase of mow th aflet al least 74 h without (1418, 
which was observed to interfere with the CAT assay. 

Protein extraction and quantitation. ( ell extracts were made from cells har- 
vested by Irvpsiniz.alion during log phase ol growth. Cell pellets were washed 
twice in Hanks balanced salt solution, and 10'' cells were resuspended in 0.1 ml 
of 0.1 M 1 l is I K 1 (pi I 7.81 and Ivsed with three freeze thaw cycles in a dry 
ice-ethanol bath. The Iv sales wet e ccnliilugccl at 3,000 c lot 10 min to icninw 
the membrane debris. I he remaining liquid w as transferred to an Hppendorf 
tube and incubated al 70 ( ' for 15 min to desiio\ a cellulat piotein that inlet feres 
with the CAT assay (37). After heat treatment, the lysate was centrifuged again 
at 12,000 X g for 10 min to remove additional proteins. The second centrifuga- 
lion step was ctucial lot i epi oclucibil il\ and maximal C.\ 1 activity. 

To standardize the amount of cellular protein in the lysates for the CAT assay, 
we utilized the BCA Protein Assay quantitation kit (Pierce, Rockford, 111.). Two 
microliters of each sample was checked against a bov ine serum albumin standard 
curve prepared for each experiment. Samples for analysis were standatdi/ed at 
900 |u.g of total protein and assayed in duplicate or triplicate. 

CAT diffusion assay. CA I assays weie pel loimecl In a modified version of the 
method published by Neumann el al. (37). Alterations to the protocol include the 
addition of a ccnli ilugalion step aflei the heat inaclivation of the lysate, an 
increase in the sample volume and total aqueous phase volume from 50 to 100 
jjlI . and the use of 5-ml plastic scintillation \ ials instead of 7-ml glass \ ials. I he 
positive control for the assay was a dilution (0.028 jjlI sample) of recombinant 
CAT enzyme (Pharmacia, Uppsala, Sweden). 1 he negative control consisted of 
900 |ji,g of protein extracted from cells containing the I.N v ector that does not 
encode ( 'A I . I he lev els of CAT activ ity expressed by the cells w ith the I N v ectoi 
were considered background and weie subtracted from the test sample results. 

I I ilh il I I Hs-l'lCI (pfl v. iiich ' i pi,,. o,o li, il |l Hon 

vial. I he reaction cocktail (200 jjj sample) was preprcparcd with 25 jjlI of I M 
I ris (pll 7.8)-25 ul of 10 mM chloramphenicol (Sigma. St. Louis, Mo. 1-0.2 ,jcl 
per sample of O.S-juut "i/juul |'ll|acetvl coenzyme A (Dupont, N V ISoston, 
Mass. 1-140.8 uJ of double-distilled I I .() and added to the lysates in the scintil- 
lation vials. I he samples were vortexed, and 47 ml of I conofluoi scintillation 
fluid (Dupont. NL'N ) was ov erlaid. I he samples weie placed in a shaking water 
bath (37 C) for up to 18 h. The numbers of counts per minute were monitored 
periodically, starling at I h. for activity within the lineal range of the assay. I he 
monitoring was performed bv scintillation counting with an I S 5800 (ISeckman 
Instruments, Palo Alto, Calif.), fhe most consistent results were achieved at 
approximately 12 h. 

Antibody staining and flow cytometry. For analysis of cell surface expression of 
the INC SI R reporter. ( '( 17 L0, and PA3P cells were haivesled with enzyme-free 
cell dissociation buffer (Gibco-BRL, Bethesda, Md.) in log phase of growth, 
washed twice in RPMI 1040. resuspended al 10" cells ml in phosphate-buffered 
saline, and placed on ice. One million cells were stained for each sample, and all 

111 I I II I i I 

incubating lor 30 min. I lien. 5 uJ of anti-rat NCI R monoclonal antibody MCI92 
(fOO (eg/ml; Oncogene Science, Lniondale, X.Y.I or the negative control (mu- 
rine IgG) wei e added, and the cells were incubated for 30 min. Excess antibody- 
was removed by two washes with phosphate-buffered saline. The fluorescent 
second antibody, phvcoeivlhi in-labeled goal ami-mouse antibody (5 u.1 I o" celb 
of a 1:20 dilution of the slock [Caltag, San Francisco, Calif.]), was added. After 



TABLE 1. CAT activity in cell lysates 



Cell type («) 


Avg CAT activity (cpm)" 


= SEM ('V control*) from: 


L-CAT-SN 


MND-CAT-SN 


PA317 (10) 


99,182 ± 11,586 (100) 


112,232 ± 10,666 (100) 


CCE (7) 


11,876 ± 2,960 (12) 


54,240 ± 12,349 (48) 


F9(3) 


122 ± 62 (0.1) 


38,122 ± 3,439 (34) 



"Activity is expressed as the number of counts per minute of 3 H-labeled 
acelvlated chloramphenicol produced bv Ivsates of ( VI , 10, or PA3I7 cells 
com, lining the indicated vecloi minus the baekgiound counts pel minute in cells 
transduced with the I.N vector from the same experiment. 

bc c control, irumbei of counts pel minute ol il-kibeled aeelv filed chloram- 
phenicol produced bv Iv sales of CC T or 10 cells div ideel bv the number of counts 
per minute of 4I4abeled acelvlated chloramphenicol piodueed bv Ivsates of 
transduced PA317 cells times 100' , . 



I p n if mi il hv i S I II eie i 1 1 li in cl ith a 1 ' Si in II 

the staining procedure and to set the gates loi data acquisition, sham-transduced 
cells weie prepared in parallel and analyzed loi each experiment. 

lequilecl unseleclecl I 0 111 opsini I eithel 48 

posi transduction, w ashed twice in phosphate -bulleied saline, and lesuspended al 
a concentration of 10 7 /ml in phosphate-buffered saline. Detection of GFP ex- 
pression was accomplished on a I AC Scan cylomelei equipped with a 488-nm 
argon laser for excitation ol the lepoilei piolein and a 5 Ml 30 nm bandpass filler 
loi monitoring the lluoiescenl emissions, l xpeiimenl.il background was estab- 
lished by including non-EGFP (LN)-transduced cells in the analysis. 

Northern and Southern blot analyses. Genomic DNA and total cellular RNA 
were extracted from cell pellets of reliov irallv transduced PA3P, CCL, and 10 
cells loi Southern and Noilhein blot analysis. ( ienomic I )NA was extracted by 
sodium dodecvl sulfate (SI )S)-proteinase K digestion overnight at 37°C in pro- 
teinase K buffer containing 0.01 M Tris-HCl (pH 7.4), 0.15 M NaCl, 0.02 M 
LIMA (pll 8.0), 0.08'V SDS, 50 pj of proteinase K (10 mg/ml) (Gibco-BRL), 
and 5 u.1 of RNAse (Gibco-BRL) per ml. Protein was removed by phenol- 
chloroform extraction, and the DNA was piecipiiated in elhanol and resus- 
pended in Tris-EDTA buffer (43). 

Quantitative Southern blots weie pel loimecl to cleieimine proviral copy num- 
ber as described bv Challita el al. (9). I he intensities of the proviral /fj-act in 
bands were compared with a ( ,S. Hiochemicals (Cleveland, C )hio) Sci-Scan 5000 
densitometer. 

Southern blots (also described in reference 9) were used to determine the 
extern of proviral methylation at the Sma\ site in the R legion of the I I R (f ig. 
1). The extent of proviral melhylalion was assessed with clensilomeliy by com- 
pai ing the intensity ol the unique band ciealed by Smul digestion to the inlensily 
of the band created by the largci .VmM-icsistant fragment. 

RNA was exti acted hom the cells with guanadinium isothiocyanate and phe- 
l-chl I III I II 

lianslei led to a nylon membrane via capillary action. Nucleic acids were cross- 
linked to the membrane with a ( V So , Hal inker (Snalagcnc, La Jolla, Calif.). 
I he nylon membrane was then hybridized to eithel ni-im or fi-actin cDNA probe 
labeled with [ 32 P]dCTP incorporated with a Prime-It random primer kit from 
Stratagene. The membrane was washed as outlined above and then used to 
expose Kodak X-Omat X-ray film. 



RESULTS 

Expression by Mo-MuLV is restricted in EC and ES cells. 
Observations in our laboratory by Challita et al. (9) indicated 
that the MND vector increased the expression of the neomycin 
resistance gene in F9 cells. Therefore, MND was selected as 
the backbone for the novel vectors in this study. To accurately 
quantitate relative gene expression levels from the standard, 
Mo-MuLV-based vectors (L-CAT-SN, L-tNGFR-SN, and L- 
EGFP-SN) and our novel, modified constructs (MND-CAT- 
SN, MND-tNGFR-SN, and MND-EGFP-SN) (Fig. 1) in F9 
EC, CCE ES, and PA317 cells, we compared CAT enzyme 
activities and levels of tNGFR protein or EGFP production. 

Measurement of CAT activity. Table 1 shows the average 
CAT assay results measured for each vector in seven indepen- 
dent experiments on cell pools from five separate transductions 
of CCE cells and three experiments from two transductions of 
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1 2 3 4 5 6 7 8 9 
bid. 2. Determination b\ Southern blotting of tlx- cop\ numbers of (A 1 
vectors in b'9 evils, i otal genomic UNA from two sets of separate!) transduced 
IV cells (labeled A and li) was digested with restriction en/\mes to release 
discrete vector fragments and was electropboresed on a 1 .2' , agarose gel. Stan- 
dard dilution series representing 3, 2, and I \ vet or copies are shown in lanes 2, 
3, and 4. respect i\ cl\ . R-aclin probing of the gel was used as a UNA loading 
control (data not shown: see the text for an explanation!. Densitometry lesults 
are included in the text. Lane (-), negative control containing DNA from 
nontransduced cells. KBL, kilohase ladder. 



F9 cells. The background in the CAT assay was established as 
the level of CAT activity measured in the CCE or F9 target 
cells transduced with the LN vector. The percent of control was 
defined as the level of CAT expression (in counts per minute 
with the background subtracted) from each vector in CCE or 
F9 cells divided by the level of expression for the same vector 
in PA317 fibroblast cells in the same experiment. The numbers 
of counts per minute (with the background subtracted) are 
shown for each vector in all three cell types. 

In the permissive PA317 cells, the L-CAT-SN vector showed 
an average CAT activity (± standard error of the mean [SEM]) 
of 99,182 (±11,586) cpm. MND-CAT-SN gave an average of 
112,232 (±10,666) cpm in the PA317 cells, which is similar to 
the value from L-CAT-SN (Student's t test result: P = 0.44). 
However, when the vectors were tested in CCE ES cells, which 
are restrictive to expression from the Mo-MuLV LTR, we 
found that L-CAT-SN produced an average of only 11,876 
(±2,960) cpm. In contrast, the MND-CAT-SN vector pro- 
duced an average of 54,240 (±12,349) cpm in the CCE cells, 
which is significantly greater than the value for the L-CAT-SN 
vector (P = 0.0031). Vector MND-CAT-SN achieved a level of 
expression in the CCE ES cells that was 48% of that in PA317 
cells, while vector L-CAT-SN produced only 12% of the activ- 
ity in CCE cells as in PA317 cells. 

In F9 cells, repression of the Mo-MuLV LTR appears to be 
more extreme than in CCE cells, with an average CAT activity 
of only 122 (±62) cpm from L-CAT-SN. In contrast, MND- 
CAT-SN produced a significantly greater average CAT activity 
of 38,122 (±3,439) cpm than did L-CAT-SN (P = 0.00038). 
While the relative expression pattern between the Mo-MuLV- 
based vector and the MND construct is maintained in the F9 
EC cells compared with the results for CCE ES cells, the 
absolute levels of expression are decreased. In the F9 EC cells, 
the MND-CAT-SN vector produced 34% of the activity seen in 
the PA317 cells, while the L-CAT-SN vector produced only 
0.1% of the CAT activity seen in the PA317 cells. Therefore, 
MND-CAT-SN produced 4 and 340 times more CAT activity 
than the L-CAT-SN vector in CCE and F9 cells, respectively. 

Quantitation of proviral integrants. In order to allow the 
comparison of gene expression on a copy-per-cell basis, it was 
necessary to determine the number of proviral integrants in the 
target cells. Quantitative Southern blotting was performed on 
DNA from pools of transduced CCE and F9 cells. To prepare 
the samples for Southern blotting, phenol-extracted DNA was 
cut with restriction enzymes that recognize sites within the 
vectors, thereby releasing a discrete fragment regardless of 



integration site or copy number. The standard dilution series 
on the left of Fig. 2 (lanes 2 to 4) are derived from a cell line 
known to contain a single proviral integrant. The dilutions in 
lanes 2 to 4 represent the equivalent of three vector copies, two 
vector copies, and a single proviral integrant, respectively. Af- 
ter correction for the DNA loading on the gel by re-hybridiza- 
tion of the blot with a p-actin probe (data not shown) densi- 
tometric analysis of the fragments hybridized to the labeled 
neo gene probe revealed that the L-CAT-SN (lanes 6 and 7) 
and MND-CAT-SN (lanes 8 and 9) vectors were present at one 
or two copies per cell in two separately transduced pools of F9 
cells. The p-actin hybridization results are not shown because 
the different restriction enzymes required to release discrete 
vector fragments produced complicated digestion patterns 
upon reprobing. Therefore, the cumulative density measured 
for all of the p-actin complementary bands in a given sample 
was used as the denominator to extrapolate the number of 
vector integrants. Southern blot analysis of two sets of trans- 
duced CCE cells similarly showed that the L-CAT-SN and 
MND-CAT-SN vectors were present at one or two copies per 
cell (data not shown). 

Northern blot analyses of vector-derived transcripts. Total 
cellular RNA from transduced CCE cells was analyzed by 
Northern blotting to assess vector expression in the target cells 
(Fig. 3). Three transcripts may potentially be produced from 
these vectors: a full-length genomic transcript initiated in the 
5' I. I R of approximately 3.4 kb, a spliced form of the genomic 
transcript approximately 3.1 kb in length, and a transcript ini- 
tiated at the SV40 promoter approximately 1.4 kb in length, 
with all transcripts terminating at the polvadenvlalion signal of 
the 3' I.I R. 

Table 2 shows the numerical values for the relative transcript 
abundances from two separate Northern blots. Figure 3 (lanes 
1 and 2) shows strong expression of the full-length genomic 
transcript and the SV40-promoted neo message from the L- 
CAT-SN and MND-CAT-SN vectors in PA317 cells. In the F9 
cells transduced with the L-CAT-SN vector (lane 3), no mea- 
surable signal was produced for either the full-length or spliced 
LTR transcripts, although the SV40-promoted neo message 
was visible. The MND-CAT-SN vector produced easily detect- 
able signals for the singly spliced LTR transcript and the SV40 
neo transcript in the F9 cells (lane 4). The modified vector 
produced an average of 27% as much LTR-promoted RNA in 



FIG. 3. Northern blot analysis of vector-derived RNA from transduced 
PA3I7, F9, and CCE cells. I he membrane was hybridized with a "P-labeled neo 
gene probe, snipped, and reprobed with labeled p-actin cDNA to allow lot 
'n loading efficiency. Densitometry results are included in the text. 
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TABLE 2. Relative abundance of vector-derived LTR transcripts 



0.072 0.47 



MND-CAT-SN/L-CAT-SN 



" Values were normalized to the abundance ot transcripts horn L-CAT-SN 
xtoi LTR in PA317 cells, which was defined as 1.0. 
* ND, not mathematically defined. 



the F9 cells as it did in the PA317 cells and > 1,000 times more 
RNA than the L-CAT-SN vector did in the F9 cells. As ex- 
pected, no evidence of neo expression is observed in the non- 
transduced F9 cells (lane 5) or the CCE cells (lane 8). 

In the CCE cells, the L-CAT-SN vector failed to produce a 
measurable amount of full-length genomic or spliced RNA, 
although a modest signal from the SV40 neo transcript is vis- 
ible (Fig. 3, lane 6). However, in a second blot (data not 
shown), the L-CAT-SN vector produced an average of 0.14% 
of the amount of LTR-promoted RNA in the CCE cells as it 
did in the PA317 packaging cells (Table 2). Remarkably, CCE 
cells containing the MND-CAT-SN vector (lane 7) produced 
52% of the amount of LTR-driven RNA as did PA317 cells 
containing the same vector and an average of 370 times more 
RNA than did CCE cells containing the Mo-MuLV-based L- 
CAT-SN vector. Interestingly, increased levels of the SV40- 
directed transcripts from the MND-CAT-SN vector were seen 
in both the CCE and F9 cells compared to those from L-CAT- 
SN. 

Southern blot analysis of proviral methylation. Challita and 
Kohn (8) demonstrated that methylation at the Smal site in- 
side the R region of the 5' LTR (Fig. 1) is inversely correlated 
with expression from the provirus; therefore, we performed the 
same analysis on these samples. Densitometry of the blots 
confirms that the L-CAT-SN vector (Fig. 4, lane 2) and the 
MND-CAT-SN vector (data not shown) are completely un- 
methylated in PA317 cells. The Mo-MuLV LTR within the 
L-CAT-SN vector was 61% methylated in the F9 cells (Fig. 4, 



3 £ 



« 0% 



61% 



92% 



32% 



14% 



FIG. 4. Southern blot to determine the extent of pro\ iral methylation. Total 
genomic DNA from PA317, CCE, and F9 cells transduced with either the 
L-CAT-SN or MND-CAT-SN vectors was digested with Nhel (lanes 1,3, and 5 ), 
Nhel and Smal (lanes 2, 4, and 6|, E, oRV (lanes 7 and 0|, or K, oRV and Smal 
(lanes 8 and 10) to release discrete \eeloi fragments. Digested samples were 
eleclrophoresed and transferred to nylon membranes. I he membranes were 
hybridized to a 32 P-labeled neo gene and used to expose X-ray film. Densitom- 
etry was performed to quantitate the intensities of the Smal-resistant fragments 
(upper bands) and .s'»Ml-sensitivc fragmt nts (lower bands I in lanes 2, 4, 6, 8, and 
10. The proportion of the total density measured from both bands that was 
contributed by the 5mal-resistant fragment represents the extent of proviral 



lane 4) and 92% methylated in the CCE cells (Fig. 4, lane 6). 
In contrast, the MND LTR is only 32% methylated in the F9 
EC cells (Fig. 4, lane 8) and 14% methylated in transduced 
CCE cells (Fig. 4, lane 10). Therefore, the extent of methyl- 
ation of each vector showed that there was an inverse relation 
between methylation of the provirus and LTR-driven expres- 
sion in all three cell models. 

Quantitation of surface antigen and intracellular reporter 
expression in individual cells, (i) tNGFR expression. Because 
the CAT assay and Northern blot analyses assess average gene 
expression from the bulk population of cells, the contribution 
of individual cells is obscured. FACS was employed to assess 
the level of LTR-promoted expression of a surface antigen 
reporter in individual CCE or F9 cells transduced with either a 
Mo-MuLV LTR-driven vector, L-tNGFR-SN, or the modified 
version, MND -tNGFR-SN. Both vectors encode rat tNGFR, 
which is expressed on the cell surface and which can be de- 
tected with monoclonal antibody MC192 (30). Since the fluo- 
rescence intensity is proportional to the amount of protein 
expressed on the surface of each cell, FACS permits discrim- 
ination between high-level gene expression in a minority of 
cells and uniform expression in a majority of cells within a 
population. Because the cells are G418 selected, each contains 
at least one copy of one of the vectors and, therefore, has equal 
potential for tNGFR reporter gene expression. 

In the PA317 fibroblasts, the MND-tNGFR-SN vector 
showed strong expression in essentially every cell, as did the 
L-tNGFR-SN vector (Fig. 5A). Mo-MuLV-based vector L- 
tNGFR-SN performed poorly in the CCE ES cells, producing 
detectable tNGFR protein in only 1.04% of the cells (Fig. 5B). 
In contrast, the MND-tNGFR-SN vector drove expression of 
tNGFR in 99.54% of the CCE cells. In the F9 cells, the di- 
chotomy is also extreme. The L-tNGFR-SN vector expressed 
the cell surface reporter in only 13.32% of the cells, but the 
modified MND-tNGFR-SN vector expressed the reporter in 
99.89% of the F9 cells (Fig. 5C). Interestingly, the level of 
expression (fluorescence intensity) from the fraction of CCE or 
F9 cells which did show expression from L-tNGFR-SN was 
equivalent to the expression levels from MND-tNGFR-SN in 
the same cells. 

(ii) EGFP expression. To explore the possibility that G418 
selection of cell pools in the previous experiments biased our 
observations on the frequency of expression or that the results 
were construct specific, we compared expression levels of the 
GFP from the Mo-MuLV LTR and the MND LTR in pools of 
transduced F9 cells without exposure to G418 (Fig. 1). In the 
packaging cells, GFP expression from both the L-EGFP-SN 
vector and the MND-EGFP-SN vector was strong (data not 
shown). Forty-eight hours after transduction, the L-EGFP-SN 
vector drove expression in only 5% of the transduced F9 cells 
(Fig. 6). As observed with the tNGFR reporter, the MND- 
I : .( il 'P-SN vector drove expression in a large proportion of the 
cells (88%) (Fig. 6). Also in agreement with the tNGFR re- 
sults, the few cells that were expressing EGFP from the Mo- 
MuLV LTR were expressing it at the same intensity as those 
that were expressing it from the MND LTR. The titers of the 
viral supernatants used to produce these populations were 
similar (~10 6 /ml), and quantitative Southern blot analyses 
(performed as described above) indicated that the L- 
EGFP-SN and MND-EGFP-SN vectors were present in simi- 
lar quantities (~1 copy per cell) in the nonselected F9 cells 
(data not shown). 

To determine if the differences in the frequency of vector 
expression were due to variations in the kinetics of vector 
activation or inactivation, we analyzed EGFP expression 48 h 
after infection and then cultured the cells in the absence of 
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G418 for an additional 20 days and reanalyzed GFP expres- 
sion. After 9 days posttransduction, the percentages of cells 
expressing EGFP from the Mo-MuLV LTR and the modified 
MND LTR were essentially unchanged from that observed at 
48 h: 6 and 84%, respectively (Fig. 6). After 22 days in culture 
in the absence of G418, the L-EGFP-SN vector was active in 
only 3% of the population while expression from the modified 
MND-EGFP-SN vector persisted at the same magnitude in 
80% of the cells. 

DISCUSSION 

In this report, we compared expression levels of Mo-MuLV 
LTR-driven retroviral vectors and vectors with modifications in 
cis elements in restrictive ES and EC cells. The modifications 
to the MND retroviral vectors target three obstacles to gene 
expression in EC and ES cells. They consist of replacement of 
the inactive Mo-MuLV enhancer with the enhancer from 
MPSV, which is active in EC and ES cells, the removal of the 
NCR to prevent binding of YY1, and substitution of the PBS 
with the <//587 PBS to prevent binding of the inhibitory repres- 
sor-binding protein. To gauge the effectiveness of the modifi- 



l'Ki. 5. FAC -s anahsis or lN( il-R expression from I.-tNC iFR-SN and MND- 
tNGFR-SN in PA317 (A), C'C'k (15), and F9 ((') cells. 1 he y axis represents the 
number of eells expressing iNCil-'R, and [he \ axis (lluoieseenee intensity) shows 
lire lex el of expression on each cell. Nonlransduccd cells arc shown by light-gray 
lines, cells transduced with L-tNGFR-SN are shown by thin black lines with the 
included aiea shaded gray, and cells transduced with MND-INC ih'R-SN arc 
shown In [hick black lines, lixpci inienls w ere performed three limes with similar 
results. For illustration, a single representative set is shown. 



cations we have incorporated into our retroviral vectors, we 
compared LTR-driven expression of the CAT gene, the 
tNGFR cell surface antigen, and EGFP gene in pools of trans- 
duced PA317, F9 EC, and CCE ES cells. 

Relative CAT expression. As expected, since the PA317 cells 
are not restrictive to expression from either the Mo-MuLV 
LTR or the modified MND LTR, the L-CAT-SN and MND- 
CAT-SN vectors drive expression equally well in the packaging 
cells. However, when we examined vector activity in cell lines 
that are not permissive for expression from the Mo-MuLV 
LTR, we found that the modified MND LTR drove expression 
more strongly than did the unmodified LTR. In CCE cells, 
expression by MND-CAT-SN was five times higher than by 
L-CAT-SN as measured by the CAT assay. In the highly re- 
pressive F9 cells, the modified MND-CAT-SN vector produced 
-300 times more CAT activity than did the L-CAT-SN vector. 
The densitometric analyses of Northern blots from transduced 
( '( 'I and F9 eells showed ,-80-fold increases in the quantity of 
RNA from MND-CAT-SN, which suggested that the enhanced 
CAT activity is the result of an increased steady-state level of 
vector-derived RNA. 

To rule out the possibility that the large increases in expres- 
sion by MND-CAT-SN and MND-EGFP-SN were due to the 
presence of increased numbers of proviral templates, copy 
number Southern blots were performed. The blots indicated 
that the numbers of vector integrants ranged between 1 and 2 
copies per cell for all of the vectors. Therefore, differences in 
gene transfer do not explain the lower level of expression from 
L-CAT-SN compared to the level from MND-CAT-SN. 

Proviral methylation. It is clear that the modifications that 
we have incorporated into the MND vector backbone allow the 
provirus to be more resistant to the process(es) responsible for 
gene silencing than the standard Mo-MuLV LTR. One poten- 
tial epigenetic mechanism of gene silencing is DNA methyl- 
ation at cytosine residues. Gene silencing and de novo meth- 
ylation frequently occur in transcriptionally quiescent DNA 
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middle row (!.-]•;( iFP-SN) shows low-level Mill' expression driven In the L- 
liGFP s\ vector at the three lime points. The bottom ion (MND-EGFP-SN) 
shows high-level EGFP expression from the modified MND-EGFP-SN vector at 
all three time points. Ihe propoi lions of e el Is di i\ ins expression are given in the 
upper right comers of the dot plots. 



sequences, while transcriptionally active DNA tends to remain 
unmethylated (7, 42). Analysis by Southern blotting indicated 
that in ES and EC cells the unmodified Mo-MuLV LTR in the 
L-CAT-SN vector was more extensively methylated at the 
Smal site than was the modified MND-CAT-SN LTR. 

It is possible that the Spl site created during the divergence 
of MPSV and Mo-MuLV might play a role in reducing the 
extent of methylation in the MND LTR. Silke et al. (44) 
showed that the incorporation of Spl transcription factor bind- 
ing sites could induce substantial demethylation of target se- 
quences in F9 cells. Brandeis et al. (6) and Macleod et al. (29a) 
demonstrated the involvement of Spl sites in the demethyla- 
tion of sequences with transgenic mice. However, in vitro and 
in vivo, the presence of Spl sites alone was not sufficient to 
ensure complete demethylation of surrounding sequences. The 
results presented here support an association between meth- 
ylation and vector silencing but do not illuminate any potential 
cause-and-effect relationship between these events. 

Probability of expression. Although our studies with the 
CAT reporter vectors have demonstrated increased expression 
from the MND-CAT-SN vector in pools of transduced cells, 
the analysis of CAT expression in bulk populations obscures 
the contribution from individual cells. Walters et al. (49) sug- 
gested that promoters function as binary controls which are 



either "on" or "off" and that their relative strengths, measured 
in a pool of cells, are a result of the frequencies with which they 
are "on" and not the rate of transcription. In their model, 
enhancers act to increase the probability of forming an active 
transcription complex and, subsequently, the frequency of 
transcription from a given promoter determines the quantities 
of RNA produced in an individual cell or in a bulk population. 

To examine the effects of the MND vector modifications on 
the frequency of vector expression, we analyzed cell surface 
expression of rat tNGFR from a Mo-MuLV-driven construct, 
L-tNGFR-SN, and a modified vector, MND -tNGFR-SN, in 
pools of CCE ES cells and F9 EC cells. Flow cytometry allows 
the quantitation of reporter gene expression on individual 
cells; therefore, analysis of retrovirally transduced cell pools 
can provide an unbiased assessment of a vector's ability to 
overcome the effects of chromosomal position and gene -silenc- 
ing mechanisms in a given cell type. 

The modified MND-tNGFR vector drove expression in es- 
sentially all of the CCE and F9 cells, while the Mo-MuLV 
LTR -driven L-tNGFR-SN vector drove expression in only a 
small percentage of the CCE and F9 cells. However, the mag- 
nitudes of expression in the fraction of CCE or F9 cells which 
did stain positive for tNGFR were similar for the two vectors. 
Presumably, the L-tNGFR-SN proviruses that achieve expres- 
sion of tNGFR in ES and EC cells found infrequent, favorable 
integration sites. 

To further examine the effects of our vector alterations and 
allay concerns that our observations on the frequency of ex- 
pression were specific to the reporter construct or were influ- 
enced by G418 selection of the cell pools used in the analyses, 
we compared levels of EGFP expression in transduced but 
unselected F9 cells. As with the tNGFR reporter, expression of 
EGFP from the Mo-MuLV LTR was detectable in only a small 
percentage of the cells. In contrast, expression of EGFP from 
the modified MND LTR could be detected in nearly all of the 
cells. In accordance with the tNGFR results and the model of 
Walters et al. (49), the magnitude of EGFP expression mea- 
sured from the minority of cells which did show expression 
from the L-EGFP-SN vector was similar to the magnitude seen 
in the cells transduced with the MND-based vector. 

To explore the possibility that vector expression might vary 
with time posttransduction, we analyzed EGFP expression 
from L-EGFP-SN and MND-EGFP-SN shortly after transduc- 
tion and after 16 and 22 days in culture without selection. We 
found that the expression profile of the Mo-MuLV-based vec- 
tor was consistently poor (—4%). The probability of EGFP 
expression from the modified MND vector was also stable for 
the duration of the experiment, occurring in a large proportion 
of the cells (-80%). 

Therefore, neither the use of different reporters nor the time 
of analysis posttransduction nor selection of the target cells 
with G418 altered the outcome or interpretation of these ex- 
periments. The ability of the modified MND LTR to increase 
the probability of gene expression is transferable to different 
reporter constructs, is not dependent upon G418 selection, and 
does not appear to have a temporal expression window. 

The Mo-MuLV LTR appears to be sensitive to chromo- 
somal position effects, is prone to de novo methylation (which 
was correlated with vector inactivity), and has a low probability 
of expression in ES and EC cells. In contrast, the MND vector 
is relatively resistant to position effects and other repressive 
forces acting in ES and EC cells. Incorporation of other ele- 
ments, such as locus control regions or scaffold attachment 
regions may further insulate vectors from inhibitory positional 
effects. 
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Franz et al. (17) and Hilberg et al. (21) isolated the PCC4 
cell-passaged myeloproliferative sarcoma virus (PCMV), 
which was active in EC but not ES cells, by applying selection 
pressure for mutants of MPSV that grew in PCC4 EC cells. 
One of the major alterations of the MPSV LTR which char- 
acterizes the PCMV LTR is the loss of one of the 75-bp 
tandem enhancer repeats present in most viruses of this family. 
Grez et al. (19) demonstrated an expanded host cell range, 
including EC and ES cells, with the construction of their mu- 
rine embryonic stem cell virus (MESV) vector. The MESV 
vector contained the LTR from the PCMV and the altered 
PBS from the rf/587rev virus. The combined use of both mod- 
ifications was shown to be essential for improved vector ex- 
pression. 

The MND vector backbone described here is similar, in 
some respects, to that of the MESV in that the MND and 
MESV vectors have the same substitution of the PBS and 
repressor protein-binding site from the d/587rev endogenous 
retrovirus (11). The LTRs of MND and MESV come from, 
respectively, MPSV (25) and PCMV (17, 21). In the LTR of 
the MND vector, we deleted a 63-bp fragment (Nhel 33 - 
SflM3a 97 ) which spanned the NCR region as defined by Flana- 
gan et al. (16), completely eliminating the YY1 binding site. In 
the wild-type MPSV LTR, the YY1 binding site is disrupted by 
a point mutation; however, during the selection process that 
created the PCMV LTR, the YY1 site was restored to the 
consensus sequence and subsequently persists in the MESV 
vector. The relative effects of these different LTR alterations 
on expression are not known. 

Mo-MuLV silencers. The observation that greater quantities 
of the SV40 transcript were produced in cells containing the 
MND-CAT-SN vector than in those with L-CAT-SN suggested 
that negative elements within the L-CAT-SN vector may in- 
terfere with expression from the internal SV40 promoter as 
well as from the LTR. Experiments incorporating Mo-MuLV 
sequences (including but not restricted to the LTR) in gene 
cassettes used to produce transgenic mice showed that multiple 
sequence elements from Mo-MuLV (such as the PBS) are 
capable of interfering with or shutting down expression of 
functional transgenes when incorporated upstream or down- 
stream in transgene cassettes (12, 32). These results suggest 
that the MND backbone may be advantageous for vectors 
intended to express genes from internal promoters, such as 
those from housekeeping genes, e.g., phosphoglycerate kinase, 
or lineage-specific enhancers, such as the (3-globin locus. 

Since retroviral vectors integrate randomly and chromo- 
somal context can have profound positive or negative influ- 
ences on gene expression constructs including retroviral vec- 
tors, effective vectors must be refractory to positional silencing 
effects in vitro and in vivo (36, 51). The resistance of the MND 
provirus to methylation may improve the long-term perfor- 
mance of the construct. The dramatic increase in the proba- 
bility of expression with the MND vector could be particularly 
useful in cells which are difficult to transduce, such as HSC, or 
where the cumulative expression increase would be most evi- 
dent, e.g., for cell surface or secreted proteins and genes which 
provide no selective advantage. Augmented expression from 
the modified MND LTR might make the difference between 
detectable and undetectable experimental results in vitro or 
between therapeutic and nontherapeutic levels of gene expres- 
sion in vivo. Therefore, these findings are potentially important 
for many research or clinical applications using retroviral vec- 
tors. 
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